On Robustness of Principal Component Regression
نویسندگان
چکیده
Principal component regression (PCR) is a simple, but powerful and ubiquitously utilized method. Its effectiveness well established when the covariates exhibit low-rank structure. However, its ability to handle settings with noisy, missing, mixed-valued, that is, discrete continuous, not understood remains an important open challenge. As main contribution of this work, we establish robustness PCR, without any change, in respect provide meaningful finite-sample analysis. To do so, PCR equivalent performing linear after preprocessing covariate matrix via hard singular value thresholding (HSVT). result, context counterfactual analysis using observational data, show recently proposed robust variant synthetic control method, known as (RSC). immediate consequence, obtain RSC estimator was previously absent. controls literature, (approximate) exists setting generalized factor model, or latent variable model; traditionally existence needs be assumed exist axiom. We further discuss surprising implication property noise, can learn good predictive model even if are tactfully transformed preserve differential privacy. Finally, work advances state-of-the-art for HSVT by establishing stronger guarantees l2,?-norm rather than Frobenius norm commonly done estimation which may interest own right.
منابع مشابه
Robust Principal Component Regression
In this note we introduce a method for robust principal component regression. Robust principal components are computed from the predictor variables, and they are used afterwards for estimating a response variable by performing robust linear multiple regression. The performance of the method is evaluated at a test data set from geochemistry. Then it is used for the prediction of censored values ...
متن کاملForecast comparison of principal component regression and principal covariate regression
Forecasting with many predictors is of interest, for instance, in macroeconomics and finance. This paper compares two methods for dealing with many predictors, that is, principal component regression (PCR) and principal covariate regression (PCovR). The forecast performance of these methods is compared by simulating data from factor models and from regression models. The simulations show that, ...
متن کاملBootstrapping Principal Component Regression Models
Bootstrap methods can be used as an alternative for cross-validation in regression procedures such as principal component regression (PCR). Several bootstrap methods for the estimation of prediction errors and confidence intervals are presented. It is shown that bootstrap error estimates are consistent with cross-validation estimates but exhibit less variability. This makes it easier to select ...
متن کاملSketching for Principal Component Regression
Principal component regression (PCR) is a useful method for regularizing linear regression. Although conceptually simple, straightforward implementations of PCR have high computational costs and so are inappropriate when learning with large scale data. In this paper, we propose efficient algorithms for computing approximate PCR solutions that are, on one hand, high quality approximations to the...
متن کاملPenalized Principal Component Regression on Graphs for Analysis of Subnetworks
Network models are widely used to capture interactions among component of complex systems, such as social and biological. To understand their behavior, it is often necessary to analyze functionally related components of the system, corresponding to subsystems. Therefore, the analysis of subnetworks may provide additional insight into the behavior of the system, not evident from individual compo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the American Statistical Association
سال: 2021
ISSN: ['0162-1459', '1537-274X', '2326-6228', '1522-5445']
DOI: https://doi.org/10.1080/01621459.2021.1928513